Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing
نویسندگان
چکیده
Initiated mainly from speech community, researches in speech to speech (S2S) translation have made steady progress in the past decade. Many approaches to S2S translation have been proposed continually. Among of them, corpus-dependent statistical strategies have been widely studied during recent years. In corpus-based translation methodology, rather than taking the corpus just as reference templates, more detailed or structural information should be exploited and integrated in statistical modeling. Under the statistical translation framework that provides very flexible way of integrating different prior or structural knowledge, we have conducted a series of R&D activities on S2S translation. In the most recent version, we have independently developed a prototype Chinese-English bi-directional S2S translation system with the supports of multilingual speech recognition and bilingual-Chunk based statistical translation techniques to meet the demand of Manos – a multilingual information service project for 2008 Beijing Olympic Games. This paper introduces our works in the research of multilingual S2S translation.
منابع مشابه
Automatic speech recognition framework for multilingual audio contents
Automatic speech recognition (ASR) for multilingual audio contents, such as international conference recordings and broadcast news, is addressed. For handling such contents efficiently, a simultaneous ASR is promising. Conventionally, ASR has been performed independently, namely language by language, although multilingual speech, which consists of utterances in several languages representing th...
متن کاملFast Calculation of Translation Model Score for Simultaneous Automatic Speech Recognition of Multilingual Audio Contents
This paper addresses automatic speech recognition (ASR) for multilingual audio contents, such as international conference recordings and broadcast news. For handling such contents efficiently, a simultaneous ASR is promising. Conventionally, ASR has been performed independently, namely, language by language, although multilingual speech, which consists of utterances in several languages represe...
متن کاملServices to Support Use and Development of Speech Input for Multilingual Multimodal Applications for Mobile Scenarios
Speech is our most natural form of interaction. Developing speech input modalities for several languages, combining speech recognition and understanding, presents various difficulties. While automatic translators ease the translation of normal text, the adaptation of grammars for several languages is currently performed based on an ad hoc approach. In this paper, we present a novel service that...
متن کاملA Trainable Approach for Multi-Lingual Speech-To-Speech Translation System
This paper presents a statistical speech-to-speech machine translation (MT) system for limited domain applications using a cascaded approach. This architecture allows for die creation of multilingual applications. In this paper, the system architecture and its components, including the speech recognition, parsing, information extraction, translation, natural language generation (NLG) and textto...
متن کاملAn Efficient Unified Extraction Algorithm for Bilingual Data
The paper presents a unified algorithm for aligning sentences with their translations in bilingual data. The sentence alignment problem is handled as a large-scale pattern recognition problem similar to the task of finding the word sequence that corresponds to an acoustic input signal in isolated word automatic speech recognition (ASR). The algorithm gains efficiency from related work on dynami...
متن کامل